Enhancing Grammatical Cohesion: Generating Transitional Expressions for SMT
نویسندگان
چکیده
Transitional expressions provide glue that holds ideas together in a text and enhance the logical organization, which together help improve readability of a text. However, in most current statistical machine translation (SMT) systems, the outputs of compound-complex sentences still lack proper transitional expressions. As a result, the translations are often hard to read and understand. To address this issue, we propose two novel models to encourage generating such transitional expressions by introducing the source compoundcomplex sentence structure (CSS). Our models include a CSS-based translation model, which generates new CSS-based translation rules, and a generative transfer model, which encourages producing transitional expressions during decoding. The two models are integrated into a hierarchical phrase-based translation system to evaluate their effectiveness. The experimental results show that significant improvements are achieved on various test data meanwhile the translations are more cohesive and smooth.
منابع مشابه
Investigating Grammatical Cohesive Devices: Shifts of cohesion in translating narrative text type
Abstract This study focused mainly on the shifts of the grammatical cohesion in texts translated from English into Persian. It aimed to identify the grammatical cohesive devices (GCDs) in ST and TT separately, based on Halliday and Hassn's Model (1976), determine the number of occurrences of GCDs in two texts and finally, illustrate types of shifts of grammatical cohesion and strategies used in...
متن کاملInvestigating Grammatical Cohesive Devices: Shifts of cohesion in translating narrative text type
Abstract This study focused mainly on the shifts of the grammatical cohesion in texts translated from English into Persian. It aimed to identify the grammatical cohesive devices (GCDs) in ST and TT separately, based on Halliday and Hassn's Model (1976), determine the number of occurrences of GCDs in two texts and finally, illustrate types of shifts of grammatical cohesion and strategies used in...
متن کاملConstrained Grammatical Error Correction using Statistical Machine Translation
This paper describes our use of phrasebased statistical machine translation (PBSMT) for the automatic correction of errors in learner text in our submission to the CoNLL 2013 Shared Task on Grammatical Error Correction. Since the limited training data provided for the task was insufficient for training an effective SMT system, we also explored alternative ways of generating pairs of incorrect a...
متن کاملSubstitution as a Device of Grammatical Cohesion in English Contexts
The present study set out to investigate the effect of teaching substitution as a kind of grammatical cohesion on the true identification of confusing substitution elements with cohesive or non-cohesive roles in different contexts and also the production of modal, reporting and conditional contexts through clausal substitution acquaintance. To this end, the following procedures were taken. Firs...
متن کاملDiscriminative Reranking for Grammatical Error Correction with Statistical Machine Translation
Research on grammatical error correction has received considerable attention. For dealing with all types of errors, grammatical error correction methods that employ statistical machine translation (SMT) have been proposed in recent years. An SMT system generates candidates with scores for all candidates and selects the sentence with the highest score as the correction result. However, the 1-bes...
متن کامل